Polishing a shiny Higgs with matrix elements 



IPPP/12/86 
DCPT/12/172 



Jeppe R. Andersen, Christoph Englert, and Michael Spannowsky 



(N 

o 

(N 

> 
O 



9-; 

a; 



o 



(N 



X 



Institute for Particle Physics Phenomenology, 
Department of Physics, 
Durham University, DH1 3LE, United Kingdom 

After the recent discovery of a Standard Model Higgs boson-like particle at the LHC, the question 
of its couplings to known and unknown matter is eminent. In this letter, we present a method that 
allows for an enhancement in S/B (irreducible) of the order of 100% in pp — > (h — > for a 

center of mass energy of 8 and 14 TeV. This is achieved by applying the matrix element method. 
We discuss the implications of detector resolution effects and various approximations of the involved 
event simulation and reconstruction. The matrix element method provides a reliable, stable, and 
efficient handle to separate signal from background, and the gluon and weak boson fusion components 
involved in this process. Employing this method, a more precise Higgs boson coupling extraction 
can be obtained, and our results are of immediate relevance for current searches. 



I. INTRODUCTION 

The performance of the CERN LHC and the rapid col- 
lection of data by both ATLAS and CMS will allow for 
detailed studies of the properties of the recently discov- 
ered boson [l], in the coming years. The first reported 
observations of the production cross section and branch- 
ing fractions are consistent Q with the expectation from 
a Standard Model Higgs boson [U with the observed mass 
rrih — 126 GeV, but sizeable uncertainties caused by the 
limited statistics are still present. 

The projected amount of data collected this year will 
not only reduce these uncertainties, but may also allow 
for the study of properties beyond total rates. 

In many extensions of the Standard Model the exis- 
tence of CP-odd bosons are predicted. Therefore, study- 
ing the CP properties of the boson is of great importance 
to confirm if the mechanism of electroweak symmetry 
breaking is minimal or not. A detailed study of the kine- 
matic distributions in the production of the boson in as- 
sociation with two jets 0, |(| will eventually allow for the 
direct measurement of the CP of the boson 0, H| ■ 

Equally important are direct measurements of the 
Higgs couplings to all other particles of the Standard 
Model. In particular, the loop-induced /177 vertex is sen- 
sitive to new degrees of freedom, where direct production 
might be beyond the energy reach of the LHC. The num- 
ber of observed signal events depends on the sum of the 
production processes and the decay branching ratio: 



a(h)-BR( 77 )~(J^gl) 
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(1.1) 



assuming no interference between the different produc- 
tion channels p, where g denote the involved Higgs cou- 
plings. The sum in the denominator runs over all kine- 
matically accessible final states in the decay. The pre- 
cision in measuring any coupling of the Higgs boson ob- 
viously benefits from separating the production mecha- 
nisms. 

In this respect pp — > hjj is important for two reasons. 



First we observe an excess in h — > 77 in current analyses, 
especially in the 2j category [l|. Furthermore, an anal- 
ysis of the h — y tt channel heavily relies on the 2j final 
state. For PV_-^ (h — > Tr)jj we currently observe under- 
production [2j . If both these findings prevail and follow 
from new physics beyond the SM, a precise investigation 
beyond simple ratios, which will also incorporate mod- 
ifications of the GF and WBF production modes, will 
major ly depend on WBF/GF separation in this channel. 

pp — > (h — > + X is roughly composed 50:50 by 

gluon fusion (GF) and weak boson fusion (WBF), de- 
pending on, e.g., the cut on the invariant dijet mass. In 
this context, it is important to note that the quantum 
interference between the GF and WBF components of 
pp — > hjj+X production is completely negligible [s|, and 
hence it is possible to consider these two contributions 
separately. Also, the theoretical uncertainty of the GF 
contribution is much larger than it is for WBF produc- 
tion fiol. |Tll|. especially within the WBF-selection cuts. 

In this letter, we construct a likelihood function which 
is based on WBF and GF matrix elements which pre- 
cisely serves this purpose to separate the production 
mechanisms, i.e. we apply the matrix element method 
to the pp — > (h — > 77) j j process on the fully- 
showered and hadronized final state. We also general- 
ize the GF vs. WBF vs. background discrimination in 
a realistic analysis by including the pp — > 77 + jj + X 
matrix elements. This constitutes the main irreducible 
backgrounds to this search [T|. Employing this strategy 
allows to optimize cut scenarios to enhance signal-over- 
background S/B. Given that the irreducible pp — > jyjj 
constitutes 0(70)% of the total background |l3[ an ad- 
ditional handle to reduce it will greatly improve the sen- 
sitivity of the experimental analyses. 

We organize this work as follows: First we discuss the 
matrix element strategy for pp — > (h — > + X for 

the LHC v/s = 14 TeV in Sec. HI This is the setup, for 
which most precision Higgs results can be obtained from 
a large luminosity sample C > 300 fb~ /experiment. We 
discuss the systematic uncertainties of various approxi- 
mations of the signal event generation, especially for the 
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FIG. 1: Exclusive number of jets distribution for the LHC 
running 14 TeV. The selection cuts are described in the text. 



either of the two hypotheses in the best suitable way, by 
definition. By including Q n to the event selection, we 
can optimize Q n ^ (Q n ) WBF depending on the pu- 
rification requirement. The expectation values (.) follow 
from MC simulations of signal and background, similar 
to the construction of simple binned log-likelihood ratio 
hypothesis tests [To] ]. 

We apply the effective top approximation in the fol- 
lowing for the GF contribution and the h — ¥ 77 decay 
via operators [l6| 
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2ttv 



F^F^h (e 2 - 7/4) . (2.2) 



GF contribution, and we also include the impact of finite 
detector resolution. 

Equipped with these insights, we discuss the impli- 
cations of the matrix element method for pp — > (h — > 
+ X for the 8 TeV run and current results [l|, 
in Sec. HID We conclude this work with a summary in 
Sec. ED 



II. HIGGS IN ASSOCIATION WITH TWO JETS 
AT 14 TEV 

A. The Matrix Element Method 

Let us introduce the observables that we are going to 
study in the remainder of this paper. The GF/WBF 
discriminating likelihood is defined 

Qn( P J,pl{K}) 

[ LIPS(77J n ) \M WBF (pp ^(h^ 77)j")r 
° g [ LIPS( 7 7j") \M GF (pp -►(/»-»■ 77)j n )| 2 . 

1n r i» WBF (pp^(fe^77)j")i 2 i f211 

g [ \M GF {p P ^ (/W77)j")l 2 J ' 1 ' ' 

where we denote the (partonic) jet multiplicity by n, 
LIPS is the differential phase space weight for the par- 
ticular kinematics (which is identical for all processes we 
consider and hence drops out of the ratio), and \M.\ 2 de- 
notes the respective matrix elements, already include the 
parton distribution functions (pdfs). We implement the 
CTEQ611 set [3 in the ratio of Eq. ([23]) . 

Eq. (|2.ip provides a one-dimensional probability dis- 
tribution, which expresses statistical compatibility with 



New degrees of freedom which alter the GF contribution 
and/or modify the Higgs couplings to weak bosons can 
be included as a global factor in the ratio of Eq. (|2.ip . 
A global factor merely shifts the Q by a constant fac- 
tor, which is irrelevant for the probabilistic discrimina- 
tion of GF vs. WBF. The model-specific details enter the 
sampling of (Q n ) WBF and in the individual normal- 
izations. 

The matrix elements that enter Eq. (|2.ip arc functions 
of parton-level kinematics and we have to define an al- 
gorithm which maps the fully showered and hadronized 
jet final state to a suitable set of (massless) kinemat- 
ics, which also includes information about the events ini- 
tial state. We do this in the following way: We first 
cluster jets with Fast Jet [TtJ and reconstruct the iso- 
lated photons according to the respective analysis re- 
quirements (see further below). We count the number 
of jets passing the pr,j threshold in the events. We then 
re-distribute the transverse recoil against unresolved ra- 
diation. The jets' momenta along the beam axis we re- 
construct from massless calorimeter cell entries at a given 
pseudo-rapidity. We scale the energy of the resulting ob- 
jects such that p 2 = 0. From the sum of these objects, 
we get an overall energy reconstructed boost of the con- 
sidered particle system, which allows to define two mo- 
mentum fractions of initial state momenta. This way, 
starting from an exclusive number of reconstructed jets 
rij and photons n 7 , that comply with the analysis require- 
ments we end up with a set of parton level four momenta 
which we use for the calculation of the ratios Q n . This 
procedure is obviously not limited to MC studies and can 
be incorporated by experiments straightforwardly. 

To discriminate signal from background we generalize 
Eq. (|2~T|) to the S/B, S/\/i?-irnproving likelihood 



{\M WBF ( PP -> (h -> 77 )j")| 2 + \M GF (p P -> (h -> 77)J")| 2 } " 
|7W 2 T(pp-> 77 j")| 2 
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FIG. 2: 2-jet and 3-jet likelihoods Q2,3 for the cuts as described in the text and *J~s = 14 TeV. We show the influence of various 
event generation modes, where "2j" refers to generating pp — s> Ty'j — > 77JJ events from 2 jet matrix elements+parton shower, 
"matched" refers to a matched 2j/3j sample, and "full t, b" stands for 2-jet events, including the full one loop mass dependence, 
interfaced to the parton shower. We also show the influence of detector and photon resolution effects. 
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For the numerical implementation of Eqs. (|2.ip and 
(|2.3[) we rely on a combination of Mad Graph v4 [3 
and Vbfnlo [II] . 



B. Event generation and selection 

We generate two and three jet CKKW-matched [2(| 
and un-matched samples with Sherpa [2l| , which imple- 
ments the effective top approximation in the gluon fusion 



channels |22j. The events are generated with Sherpa's 
default CT10 [23| pdf set to avoid biasing the analysis of 
the likelihood distributions. 

It is known that the effective theory does not pro- 
vide a valid description of the phenomenology as soon 
as we are sensitive to momentum transfers larger than 
the top mass, e.g. prj > m*. Cross sections, on the 
other hand, are reproduced at the percent level, which 
follows from smaller effective theory cross sections for 
PT,h ^ m t cancelling the excess with respect to the 
full calculation for ptm ^ rn t . For large momentum 
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the remainder of this section, we therefore adopt a cut 
setup which is based on phenomenological analyses (see 
e.g. Refs. @, HI, E3 ) ■ In Sec. (jm]) we employ the current 
8 TeV selection of ATLAS Q. 

We reconstruct kr jets [H[ with D = 0.8 and threshold 
Pt,j > 30 GcV in the pseudo-rapidity range \rjj\ < 4.5, 
and require nj > 2. The exactly two isolated photons are 
required to be central in the electromagnetic calorimeter 
|?7 7 | < 2.5 with pt,7 > 30 GeV (we define a photon to be 
isolated if the electromagnetic and hadronic activity in 
the cone with size R = 0.3 is less then 10% of the Et of 
the photon candidate). The photons have to reconstruct 
the Higgs mass = 126 GeV within 115 GeV < (p 7l i + 
Pj,2) 2 < 135 GeV. On top of these generic cuts we impose 
a typical WBF selection: The two jets, leading in px are 
required to have a large invariant mass rrij 1 j 2 > 600 GeV 
and have to fall into opposite detector hemispheres yj 1 x 

Vh < o. 
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FIG. 4: pp — > hjj production via gluon fusion (including full 
top and bottom contributions) broken down to the partonic 
channels and comparison of WBF (at NLO QCD) vs. GF as 
a function of the cut on the invariant dijet mass. Results are 
obtained with Vbfnlo [Ts| . 



transfers PT,h ~ 300 GeV the deviations become larger 
than O(30%). To fully capture the dynamics in a two- 
dimensional likelihood, which extends Eqs. (|2.1j) . (|2.3j) 
by the inclusion of the events energy scale would need to 
incorporate the full mass dependence and a parametriza- 
tion of potential new physics. 

To test our effective theory approximation (note that 
we both simulate and analyze the event within the ef- 
fective approximation), we also analyze a GF event sam- 
ple obtained with Vbfnlo [IH, which includes the full 
top and bottom contributions. We pass the parton-level 
events to HerwigH — h [24[ for showering and hadroniza- 
tion. This allows us to asses the bias of analyzing the 
events with effective theory matrix elements. 

Finite detector resolution effects give rise to an addi- 
tional systematic uncertainty when we want to analyze 
the discriminating power of the matrix element method. 
To assess their impact on Eq. (|2.1[) and (|2.3[) . wc model 
jet momentum uncertainties according to Ref. (25| : 
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For the photons we have performed a comparison of 
the invariant diphoton mass against the experimental 
results of 0, an d find that a resolution uncertainty 
parametrized by 3% describes the experimental situation 
in sufficiently well for our purposes. 

Details of the 14 TeV cut set-up are currently un- 
known; they will depend on the new pile-up and under- 
lying event conditions as well as on the potential excess 
in the pp — > (h — > + X signal cross sections. For 



C. Performance of Q n 

For these 14 TeV selection criteria we show the nor- 
malized distribution of the exclusive number of jets for 
14 TeV in Fig. [T] (for a detailed analysis of this observable 
see Ref. (29[). The cross sections are er GF ~ 0.61 fb and 
a WBF ~ 1.58 fb. From this we see that we can limit our 
analysis to Q n , Q h n for n — 2, 3. Higher jet multiplicities 
contribute at a level which is challenged by the theo- 
retical uncertainties of the inclusive p — > hjj + X cross 
sections [ljj [LH ■ The extension to n > 3 is technically 
straightforward. 

We show our results in Fig. O An immediate first ob- 
servation is that neither the definition of the likelihoods 
nor the impact of cither detector resolution or details 
of the event simulation have a significant impact on the 
discriminating power of Q n= 2,3- Keeping in mind that 
we already look at WBF-like events and have a recon- 
structed Higgs boson, a cut on Q2 will significantly en- 
hance the WBF contribution over GF production. This is 
even more the case for Q3. This can be understood along 
the following lines. Additional jet radiation in the WBF 
component* is essentially QCD bremsstrahlung off the 
leading jets since there is no color exchange between the 
quark legs. This is entirely different for the GF contribu- 
tion, which tends to populate the large available phase 
space in the central region with QCD activity [3l|. Not 
only the presence of this radiation [32| . but, more impor- 
tantly, the information that is encrypted in the differen- 
tial energy-momentum flow associated with it 0] provides 
an elaborate handle to separate WBF from GF. This is 
most efficiently reflected in Q n >3, of which only n = 3 is 



'For a detailed discussion of this contribution beyond LO see 
Ref. [H. 
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FIG. 5: 2-jet and 3-jet likelihoods for the cuts as described in the text and \fs = 14 TeV. Detector resolution effects are 
included. The lower panel gives results for a loosened cut set with rrijj > 400GeV. 



statistically relevant^. 

The result of GF/WBF purification is depicted in 
Fig.[3J Going to large values |Qa,3| 3> 1, we recover a high 
level of purification within the limits of the selection cri- 
teria as expected, a GF /a WBF < 0.1, a GF /cr WBF > 3, re- 
spectively. Comparing to Figs. [3J this comes at the price 
of increasingly small signal cross sections. A more realis- 
tic analysis (that also includes a more realistic description 
of the detector environment, pile-up, etc.) needs to opti- 
mize the signal rate with respect to purification. Apply- 
ing multiple cuts on Q n for an identically chosen basic cut 
set-up, on the other hand, can be used to define distinct 
signal regions which allow to reconstruct the WBF/GF 
content upon correlating the two measurements. 

This finding needs to be contrasted to the standard 
paradigm of WBF/GF separation via a larger rrijj cut. 
In Fig. |U we show the relative fraction of full one-loop 
GF vs. NLO QCD WBF for ^fs = 14 TeV as a function 
of the imposed rrijj selection. We see that the emerging 
quark-induced processes at large x for a stiff rrijj cut sat- 
urate the relative fraction at about 15%. The resulting 
event topology is WBF-like at small total cross sections. 
Central jet vetos can further decrease the relative contri- 



^The event shape observables discussed in Ref. Q also capture sen- 
sitivity from relatively soft radiation which, by definition of Q n as 
jet based observables is not considered. 



bution, but challenges the perturbative description 



We now turn to signal vs. background discrimination. 
In Fig. [5] we show the distributions for Q\ 3 for our 

\fs = 14 TeV selection. Obviously, by cutting on Q\ 3 , we 
can purify the signal+background sample without loos- 
ing too much of the signal count. This way the signal 
compared to the irreducible background can be increased 
by > 100%. It is worth noting that this will affect the 
GF and WBF contribution almost identically (this is the 
reason why we choose to plot WBF and GF as separate 
samples in Fig. [5]). 

In order to gain a better handle on the GF distribu- 
tions we can relax the rrijj cut such that the event is not 
forced into a WBF-type topology. In Fig.|H we also show 
distributions for rrijj > 400GeV. As expected, the differ- 
ence in the WBF/GF samples is now more pronounced 
in Q h n . The irreducible 77 background grows by approx- 
imately a factor of two while the gluon fusion and weak 
boson fusion cross sections are 0.95 fb and 2.1 fb. This 
is an increase by factors 1.6 and 1.3 respectively, com- 
pared to the rrijj > 600 GeV selection. We find similar 
WBF/GF separation properties as in Fig. [3] Note that 
the irreducible background grows disproportionally, and 
a signal vs. background enhancement will vitally depend 
on a good S/B discriminator, which is exactly provided 

by Ql 
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FIG. 7: Exclusive number of jets distribution for LHC 8 TeV. 



III. APPLICATION TO HIGGS IN 
ASSOCIATION WITH TWO JETS AT 8 TEV 



We can straightforwardly adopt the strategy of 
Sec. UTAl to the current 8 TeV setup. The ATLAS se- 
lection for the two jet category of the h — > 77 search is 
as follows Q: We cluster anti-fcr jets [H] with Fast- 
Jet [13] for D = 0.4 and select at least two jets with 
Pr,j > 25 GeV and prj > 30 in the more forward region 
2.5 < < 4.5. The hardest jets are required to have 
a rapidity gap IAt^I > 2.8 and the dijet system has 
to recoil against the diphoton system in the transverse 
plane A(f)(jj, 77) > 2.6. Again as in Sec. |TT]we require 



a Higgs mass reconstruction within 20 GeV interval cen- 
tered around rrih = 126 GeV. 

The exclusive number of jets for this selection is again 
shown in Fig. [7] and we find agreement of our analy- 
sis with the experiment's quoted number of 3 expected 
events in 4.7 fb _1 . Obviously, there is again no need to 
go beyond n = 3. 

Finally we again analyze the potential S/B improve- 
ment (where B refers to the irreducible background for 
our purposes), which is the key limiting factor when deal- 
ing with the small event rates for the 8 TeV run. Fig. [5] 
shows a similar behavior as Fig. [SJ we infer that we can at 
least gain a factor of 100% in S/B without cutting into 
the signal count in the currently applied selection. All 
remarks of the 14 TeV results generalize to the lower en- 
ergy of 8 TeV, and again the GF and WBF signals rates 
are affected identically by selecting events according to 



IV. SUMMARY AND CONCLUSIONS 

In this paper, we have applied the matrix element 
method to pp — > (h — > production and investi- 

gated the prospects to separate the GF and WBF contri- 
butions. This is of utmost importance for CP analyses of 
the newly discovered particle, as well as for the measure- 
ment of its couplings to known matter. The same method 



7 



can be applied to other decay modes of the Higgs boson, 
e.g. h — > tt. 

We find that the matrix element method provides an 
excellent discriminator, which is stable against finite de- 
tector resolution effects and event generation system- 
atics. We have extended WBF/GF separation to sig- 
nal vs. (irreducible) background discrimination an find 
promising results. WBF/GF separation and enhanced 
signal vs. background discrimination can be achieved si- 
multaneously. Our results are directly relevant for the 
current 2-jet category of h — > 77 analysis, improving 
the sensitivity to the signal over the irreducible 77 back- 
ground by > 100%, thus leading to a potential sensitivity 
increase by 35% when reducible backgrounds are consid- 



ered unaltered. 

Already in the presently available data sets using Q„ 
the event selection cuts can be relaxed to boost the signal 
event count, and thus significance, without affecting S/B. 

Selection strategies which are based on Q„ in a exper- 
imentally realistic analysis will allow to measure Higgs 
properties to better precision than currently foreseen. 
The implementation of the method will become publicly 
available to the experiments. 
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